Organizational Approaches for Peer-to-Peer based Information Retrieval System
نویسنده
چکیده
In recent years, Peer-to-Peer based information retrieval systems have received significant interest among computer scientists. A peer-to-peer based information retrieval system consists of a set of nodes connected in a peer-to-peer fashion. Each node hosts a document collection to share with other nodes and these nodes work collectively to provide information retrieval service to users. In such a system, an information retrieval task can be considered as a search session during which nodes forward the query to their neighbors, perform local searches and/or return search results. While the promise of this type of applications is attractive, the underlying technology is challenging. The lack of complete, up-to-date information of states of other nodes in the network requires sophisticated strategies for effective distributed search. In addition, the presence of concurrent search sessions adds an another level of complication: nodes may not be able to complete forwarding and perform local searches for all queries they have received in a timely fashion due to bandwidth and processing capacity limitations. This thesis frames a P2P IR problem into a multi-agent framework and attacks it from an organizational perspective by exploring various adaptive, self-organizing topological organizations and designing appropriate coordination strategies for large-scale agent organizations. Specifically, two protocols have been designed to create semantic based implicitly clustered agent organizations and explicit multi-level hierarchical agent organizations respectively in the context of single-query peer-to-peer based information retrieval systems, i.e, each external query is processed until completion before another external query is allowed to enter the system. In forming implicitly semantically-close clusters, agents exchange their resource descriptions to expand their local information about the content distribution over the network. Agents then prune the topology based on predefined rules. Two search strategies were evaluated on the reorganized topology. The experimental results demonstrated that the topology reorganization process combined with a context-aware search algorithm can improve considerably the information retrieval performance. In forming an explicit, multi-level topical hierarchical structure to facilitate locating relevant documents, agents join different groups in the hierarchy based largely on their content similarity. The group formation is achieved by organizing the agent-view structures properly so as to place semantically similar agents together to form explicit groups in an incremental and distributed manner. A context-aware search algorithm is also designed to take advantage of the hierarchical organization. During the search process, agents in the network follow various cooperation strategies to forward queries and return results in the network. …
منابع مشابه
PeerVOIRE - Proposal for a Peer-to-Peer Semantic Information Retrieval System
The exponential increase in available data has led to an ever growing interest in information retrieval techniques. Lexical matching often fails due to synonymy and polysemy. Hence, semantic approaches have been searched for and one of these approaches is Latent Semantic Indexing (LSI). The inconvenient with LSI is its heavy computational needs. Leaving all this demand on one system is impracti...
متن کاملThe Viewpoints of Alborz University of Medical Sciences’ Faculty Members on Open Peer Review of Journal Articles
Background and Aim: The open peer review process, which is one of the peer-reviewed methods in journals, has been accepted in scientific forums. The aim of this study was to investigate the points of view of university faculty members about the open peer review process of journal articles. Materials and Methods: The study used a descriptive survey. The sample size was calculated using the Coch...
متن کاملDistributed Information Retrieval and Automatic Identification of Music Works in SAPIR
The Search in Audio-visual content using Peer-to-peer Information Retrieval (SAPIR) project is an EU IST FP6 research project that aims at developing theories and technologies for the next-generation search techniques, which will effectively and efficiently deliver relevant information from exponentially growing distributed and very dynamic multimedia databases. The SAPIR consortium includes ex...
متن کاملThe Design of PIRS, a Peer-to-Peer Information Retrieval System
We describe the design of PIRS, a peer-to-peer information retrieval system. Unlike some other proposed approaches, PIRS does not require the centralization of data onto specially designated peers. It is therefore applicable to a larger environment. We explain our design decisions, analyzing its benefits and potential shortcomings. We then show that PIRS significantly improves over search perfo...
متن کاملCollection Profiling for Collection Fusion in Distributed Information Retrieval Systems
Discovering resource descriptions and merging results obtained from remote search engines are two key issues in distributed information retrieval studies. In uncooperative environments, query-based sampling and normalizing scores based merging strategies are well-known approaches to solve such problems. However, such approaches only consider the content of the remote database and do not conside...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004